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Abstract. We study techniques for deciding the computational complexity of infinite- 
domain constraint satisfaction problems. For certain fundamental algebraic structures 

Cn ' A, we prove definability dichotomy theorems of the following form: for every first-order 

^ , expansion F of A, either F has a quantifier-free Horn definition in A, or there is an element 

d of F such that all non-empty relations in F contain a tuple of the form (d, . . . , d), or all 
relations with a first-order definition in A have a primitive positive definition in F. 

The results imply that several families of constraint satisfaction problems exhibit a 
complexity dichotomy: the problems are in P or NP-hard, depending on the choice of the 

^\i ' allowed relations. As concrete examples, we investigate fundamental algebraic constraint 

satisfaction problems. The first class consists of all first-order expansions of (Q; -f ). The 
second class is the affine variant of the first class. In both cases, we obtain full dichotomies 
by utilising our general methods. 
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1. Introduction 

Constraint satisfaction problems (CSPs) are computational problems that appear in 
almost every area of computer science such as artificial intelligence, graph algorithms, 
scheduling, combinatorics, and computer algebra. Depending on the type of constraints 
that are allowed in the input instances of a CSP, the computational complexity of a CSP 
is usually polynomial (we will call these CSPs tractable), or NP-hard. In the last decade, 
a lot of progress was made to find general criteria that imply that a CSP is tractable, or 
that it is NP-hard. Such results have been obtained for constraint languages over finite 
domains [5tll H[T^ll7j . but also for constraint languages over infinite domains that are u- 
categorical (for formal definition of these concepts see Section [2]). For example, it has been 
shown that for every structure T with a first-order definition in (Q; <) the problem CSP(r) 
is in P if it falls into one out of nine classes, and is NP-hard otherwise [6j. 

Lately, many researchers have been fascinated by a conjecture due to Feder and Vardi [15j 
which is known as the dichotomy conjecture. This conjecture says that every CSP with a 
finite domain constraint language is either tractable (i.e., in P) or NP-complete. According 
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to a well-known result by Ladner [19], there are NP- intermediate computational problems, 
i.e., problems in NP that are neither tractable nor NP-complete (unless P=NP). But the 
problems that are given in Ladner's construction are extremely artificial. The question why 
there are so few candidates for natural NP-intermediate problems is one of the mysteries in 
complexity theory. 

Any outcome of the dichotomy conjecture is probably surprising: a negative answer 
would finally provide relatively natural NP-intermediate problems, which would be of inter- 
est in complexity theory. A positive answer probably comes with a criterion which describes 
the NP-hard CSPs (and it would probably even provide algorithms for the tractable CSPs). 
But then we would have a rich catalogue of computational problems where the computa- 
tional complexity is known. Such a catalogue would be a valuable tool for deciding the 
complexity of computational problems in the mentioned application areas: since CSPs are 
abundant, one might derive algorithmic results by reducing the problem of interest to a 
known tractable CSP, and one might derive hardness results by reducing a known NP-hard 
CSP to the problem of interest. 

In this article, we study two natural classes of infinite domain constraint languages, and 
show that the corresponding CSPs do exhibit a complexity dichotomy. To the best of our 
knowledge, this is the first systematic complexity result for classes of structures that are not 
cj-categorical. The first class consists of all first-order expansions of (Q; {(x, y,z) \ x + y = 
z}) (i.e., we add relations to {Q;{{x,y,z) \ x + y = z}) that are first-order definable in 
{Q;{{x,y,z) \ x + y = z}). The second class is an affine version of the first class, and 
consists of all first-order expansions of (Q; {(a, b,c,d) \ a — b + c = d}). That the structures 
(Q; {(x, y,z) \ x + y = z}) and (Q; {(a, b,c,d) \ a — b + c = d}) are not w-categorical follows 
immediately from the theorem by Engeler, Ryll-Nardzewski, and Svenonius (cf. Theorem 
6.3.1 in [E]). It is even the case that the corresponding CSPs cannot be formulated by any 
cj-categorical template; the basic proof idea is presented in [TJ Proposition 1]; also see [4J. 

Our results follow from theorems about primitive positive definability: we show that for 
every relation R with a first-order definition in (Q;+), either R has a quantifier- free Horn 
definition in (Q; +), or R contains the tuple (0, . . . ,0), or all relations with a first-order 
definition in (Q;+) have a primitive positive definition in (Q;+,i?). The analogous result 
also holds for the affine case. The techniques that we use to prove these two definability 
theorems are more general than the two classification results, and they are very different 
in nature. One technique applies for structures 'that have little structure'; to be precise, 
for all structures F where = and 7^ are the only primitive positive definable non-trivial 
binary relations (Section [5|). In particular, they apply to structures with a 2-transitive 
automorphism group. The other technique applies for structures 'with a lot of structure'; 
informally, it applies whenever we can find a primitive positive definition for the line between 
two points in Q'^ (Section Hj). 

The rest of this paper is organised as follows: in Section [21 we provide some background 
material on constraint satisfaction and logic. A tractability result for templates that have a 
quantifier-free Horn definition in (Q; +) is presented in Section [3l The classification result 
for (Q; +) can be found in Section [4] while the results for the affine case are collected in 
Section [5l Finally, a number of open questions and directions for future work can be found 
in Section [H 
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2. Preliminaries 

Let r = {D; Ri, . . . , Rn) be a relational structurqj with domain D (which will usually be 
infinite) and finitely many relations i?i, . . . , i?„. The constraint satisfaction problem for T 
(short, CSP(r)) is the computational problem to decide whether a given primitive positive 
sentence $ involving relation symbols for the relations in F is true in F. A first-order formula 
is called primitive positive if it is of the form 

3xi,...,Xn.'>pl A... Alpm 

where tpi are atomic formulas, i.e., formulas of the form x = y oi R{xi^, . . . ,Xi^) with R 
the relation symbol for a fc-ary relation from F. We call such a formula a pp-formula. The 
conjuncts in a pp-formula <1> are also called the constraints of $. We also refer to F as a 
constraint language (it is also often called the template) of CSP(F). 

We say that a first-formula (j) defines a relation i? in F when (j){ai, . . . ,0^) holds in F 
iff (ai, . . . , Ofc) € R. If (j) is primitive positive, we call R primitive positive definable (pp- 
definable) over F. The following simple but important result explains the importance of 
primitive positive definability for constraint satisfaction problems. 

Lemma 2.1. Let F be a relational structure and F' be an expansion of this structure by a 
pp-definable relation R over F. Then CSP(F) is polynomial-time equivalent to CSP(F'). 

Lemma F2.1l will be used extensively in the sequel and we will not make explicit references 
to it. Another important class of formulas are Horn formulas; a first-order formula in 
conjunctive normal form is Horn if and only if each clause contains at most one positive 
literal. A relation R is called quantifier- free Horn definable over F if there exists a quantifier- 
free Horn formula that defines i? in F. Note that Lemma 12.11 does not hold if we replace 
'pp-definable' with 'Horn definable'. 

By choosing an appropriate structure F many computational problems that have been 
studied in the literature can be formulated as CSP(F) (see e.g. [UllOKIl]). It turns out 
very often that the structure F can be chosen to be w-categorical. A structure is called 
to -categorical if the set of all first-order sentences that is true in the structure has only one 
countable model, up to isomorphism. A famous example of an w-categorical structure is 
(Q; <). The condition of w-categoricity is interesting for constraint satisfaction, because 
the so-called universal-algebraic approach, which is currently intensively studied for finite 
constraint languages, applies — at least in principle — also for w-categorical structures (see 
e.g. |6j for an application of the universal-algebraic approach to CSPs for constraint lan- 
guages over infinite domains). In this article, we demonstrate that systematic complexity 
classification can be performed for constraint languages over infinite domains even if the 
constraint languages are not w-categorical. 

Example. Let F denote the structure 

(Q; {(x, y,u,v) \ {x = 2y V y = u + v)Axj^ u}) 

It can be shown that CSP(F) cannot be formulated with an cu-categorical template (for 
a very similar proof, see [1]; a necessary and sufficient condition about which CSPs can 
be formulated with w-categorical templates can be found in ^). One can show that the 
relations {{x,y) \ x ^ y} and {{x,y,z) \ x = y + z} have pp-definitions in F. It is 
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now straightforward to determine the computational complexity of CSP(r) by combining 
Lemma |2. II and our classification result (Corollary 14. 6p . 

We will sometimes consider the automorphism group Aut(r) of a template T over a 
domain D, i.e., the group formed by the set of all automorphism^^ of F with respect to 
functional composition. An orbit of Aut{T) on D^ is a set of the form {{a{a),a{b)) \ a € 
Aut{r)}, for some a,b € D. We note that pairs from the same orbit satisfy the same 
first-order formulas. 

Let D be an arbitrary infinite set and arbitrarily choose an element d G D. The 
complexity of CSP(r) where F has a first-order definition in (D; =) (so-called equality 
languages) has been classified in [5j. We note that if R is first-order definable in {D;=) 
and {d, . . . ,d) £ R, then (d', . . . ,d') € R for every d' € D. Thus, the exact choice of d is 
irrelevant when stating the following theorem. 

Theorem 2.2 (of [5j). Let F be a template with a first-order definition in (D; =). Then, 
all relations in F have a quantifier- free Horn definition in (D;=), or all non-empty relations 
in F contain the tuple {d, . . . ,d), or else every first-order definable relation in (D; =) has a 
pp-definition in F. In the last case, CSP(F) is NP-complete. 

Instead of using Theorem 12.21 in its full generality, it will be sufficient to use a simple 
corollary. For any set D, the relation Sd denotes the relation 

{{x, y,z) £ D \y^zA{x = y\/x = z)} . 

Corollary 2.3. Let D be an infinite set. Every first-order definable relation in (D; =) has 
a pp-definition in {D;Sd)- 

Proof. The relation So has a first-order definition in (D; =) and does not contain the tuple 
{d, d, d). It is easy to verify that So has no quantifier- free Horn definition in (D; =) so every 
first-order definable relation in [D] =) has a pp-definition in (D; So) by Theorem 12.21 □ 



3. Tractability 

For all relational structures F with a quantifier-free Horn definition in (Q;+), the 
problem CSP(F) can be solved in polynomial time. This follows from a more general 
algorithmic result in [18] . However, the algorithm presented there solves a linear number of 
linear programs, and thus the best known algorithms have a rather high worst-case running 
time. We present a more efficient algorithm for the special case that is relevant in our paper. 
We denote by 0^{f{N)) the class of all functions of asymptotic growth at most f{N) up 
to poly-logarithmic factors. 

Proposition 3.1. Let F be a relational structure whose relations have a quantifier-free 
Horn definition in (Q;+). Then there is an algorithm that solves CSP(F) in time 0~(A^^) 
where A^ is the size of the input. 

The algorithm we present in the proof of Proposition 13.11 is a combination of general 
techniques in constraint satisfaction [2l[T3] and a polynomial implementation of Gaussian 
elimination algorithm on rational data. Since the input of CSP(F) consists of a primitive 
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positive sentence whose atomic formulas are of the form R{xi, . . . , x^) where R is quantifier- 
free Horn definable over (Q; +), we can as well assume that the input to our problem consists 
of a set of Horn clauses over (Q; +). 

We have to make some remarks about the worst-case running time of the Gaussian 
elimination algorithm. It is well-known that the Gaussian elimination requires 0{n?'m) 
many arithmetic operations on rational numbers, where m is the number of equations and 
n is the number of variables. In our algorithm, we have to solve a linear number of linear 
equation systems Si,. . . , Sm', however, system Si+i is obtained from system Si by adding a 
single linear equation. Since the Gaussian algorithm can be presented in such a way that it 
computes a system in triangular form, adding successively equation by equation, the overall 
costs for solving ^i, . . . , Sm equals the cost to solve Sm with Gaussian elimination. 

Also recall that the size of the numbers involved when performing the Gaussian elim- 
ination algorithm might grow exponentially when implemented without care. However, 
when we use the Euclidean algorithm to shorten the coefficients during the elimination 
process, the Gaussian elimination algorithm can be shown to be polynomial [14]. We are 
only interested in deciding solvability of linear equation systems, and not constructing so- 
lutions, and so we even have linear bounds (in the input size) on the representation size 
of all numbers involved in deciding solvability for linear equation systems over the rational 
numbers with Gaussian elimination (see p2], proof of Theorem 3.3). Finally we remark 
that the most costly arithmetic operation that has to be performed on rational numbers 
during the elimination process is multiplication, and multiplication can be performed in 
time 0(s log slog logs), where s denotes the representation size of the two rational num- 
bers (in bits). Hence, the overal running time for solving Si, . . . ,Sm with the discussed 
implementation of the Gaussian elimination algorithm is in 0^{N'^). 

We will show that our algorithm for CSP(r) can be implemented such that it has the 
same overall asymptotic worst-case complexity. 



Solve($) 

// Input: An instance $ of CSP(r) 

// where all relations in F have a quantifier-free Horn definition in (Q; -I-) 

// Output: yes if $ is true in T, false otherwise 

Let C be the set of all Horn-clauses from each constraint in $ 

Let U be the subset of C that only contains clauses with a single positive literal. 

Do 

For all negative literals -10 in clauses from C 

lilA implies delete the negative literal -10 from all clauses in C. 

If C contains an empty clause then return unsatisfiable. 

If C contains a clause with a single positive literal ip, add {?/'} to U. 
Loop until no literal has been deleted 
Return satisfiable. 



Figure 1 : An algorithm for the constraint satisfaction problem where all constraint relations 
have a quantifier-free Horn definition in (Q; -|-). 



Proof of Proposition \3.1[ We first discuss the correctness of the algorithm shown in Figured! 
and then explain how to implement the algorithm such that it achieves the desired running 
time. 
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When hi logically implies (j) then the negative literal -^(p is never satisfied and can be 
deleted from all clauses without affecting the set of solutions. Since this is the only way 
how literals can be deleted from clauses, it is clear that if one clause becomes empty the 
instance is unsatisfiable. 

If the algorithm terminates with yes, then no negation of a disequality is implied by lA. If 
r is the rank of the linear equation system defined by lA, we can use the Gaussian elimination 
algorithm as described above to eliminate from all literals in the remaining clauses r of the 
variables. Let S be the maximal sum of the absolute values of all coefficients in one of the 
remaining inequalities plus one. Then setting the i-th variable to S* satisfies all clauses. 

To see this, take any disequality, and assume that i is the highest variable index in this 
disequality. Order the disequality in such a way that the variable with highest index is on 
one side and all other on the other side of the 7^ sign. The absolute value on the side with 
the i-th variable is at least S"*. The absolute value on the other side is less than S* — S, since 
all variables have absolute value less than S*"^ and the sum of all coefficients is less than 
5 — 1 in absolute value. Hence, both sides of the disequality have different absolute value, 
and the disequality is satisfied. Since all remaining clauses have at least one disequality, all 
constraints are satisfied. 

We finally explain how to implement the algorithm such that it runs in time 0"~'{N^). To 
decide whether U implies an equality (j), we compute in each interation of the main loop the 
triangular normal form for the linear equation system determined by lA as described before 
the statement of the Proposition. The overall costs to do this are in 0~(A^^). Moreover, 
for each negative literal we maintain an equation where we eliminate as many variables as 
possible using the computed triangular normal form. If one of these equations becomes 
trivial (i.e. is the form a = a) we conclude that the equation is implied by lA. The overall 
costs for doing this is also bounded by 0^{N'^) by a very similar argument as given before 
the statement of the proposition. With appropriate straightforward data structures, the 
total costs for removing negated literals -i(/> from all clauses when (j) is implied by U is 
linearly bounded in the input size since each literal can be removed at most once. □ 



4. The Rational Numbers with Addition 

In this section we present the complexity classification for first-order expansions of 
(Q; {{x,y,z) \ X + y = z}). We begin in Section WA\ with a result about the pp-definability 
of the disequality relation 7^ in first-order expansions of (Q; {(x, y,z) \ x + y = z}). When 
the relation 7^ is pp-definable, we show that also the relation Sq (defined in Section [2] as the 
relation {{x,y,z) G Q'^ \ x ^ z f\{x = y\J y = z)}) is pp-definable whenever the constraint 
language contains a relation R that is first-order, but not quantifier- free Horn definable in 
(Q;+); this is shown in Section l4.2i Finally, Section 14.31 completes the classification for 
first-order expansions of (Q; {{x, y,z) \ x + y = z}). 

4.1. Definability of Disequality. 

Lemma 4.1. For any structure F with a first-order definition in (Q;+), the first-order 
definable relations in F are a subset of {Q, Q \ {0}, {0}, 0}. 

Proof. Let i? be a unary relation with a first-order definition in (Q; -|-). The statement is 
clear if R does not contain any element distinct from 0, so let a be from Q \ {0}. We have to 
show that R = Q or R = Q\ {0}. Observe that for any c G Q, c 7^ 0, the mapping x 1-^ ex 
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is an automorphism of T. Hence, for any b ^ there is an automorphism of (Q; +) that 
maps a to b. Since automorphisms preserve first-order formulas, so 6 G i? and the claim 
follows. □ 

Note that x = is equivalent to a; + x = a; and hence the relation {0} is pp-definable 
over (Q; +); thus we can use freely as a constant symbol in pp-definitions over T. 

Proposition 4.2. Let F be a first-order expansion of (Q; -|-) containing a non-empty relation 
R such that R{x, . . . ,x) is false for any x. Then ^ is pp-definable in F. 

Proof. Observe that if the set Q \ {0} has a pp-definition </>(«) in F, then the pp-formula 

3u, y' . (p{u) Ay + y' = OAx + y' = u 

defines x ^ y over F. 

Let 5" be a non-empty pp-definable relation in F of minimal arity such that S{x, . . . ,x) 
defines the empty set. Let k be the arity of 5. First, assume that S{xi,X2, ■ ■ ■ , x^) Axi = X2 
is satisfiable. Then the (A; — l)-ary relation S'{x2, ■ ■ ■ , Xk) defined by S{x2,X2, • • • , Xk) is non- 
empty, and S"(x, . . . , x) defines the empty set; this is in contradiction to the choice of S. 

Assume next that 5(xi, . . . ,Xfc) A xi = X2 is unsatisfiable. Define the unary relation 
r(x) by 

3X3,...,XA;.5(X,0,X3,... ,Xfc) 

and the unary relation U{y) by 

3x1, X3, ■■■,Xk. S{xi,y, X3, . . . , Xk) ■ 

By Lemma [4. H both T and U are from {Q, Q \ {0}, {0}, 0}. The relation T cannot be equal 
to {0} or to Q since this contradicts the assumption that 5(xi,X2, • • • ,Xk) A xi = X2 is 
unsatisfiable. If T is equal to Q \ {0}, then by the initial observation ^ is pp-definable in F 
and we are done. We conclude that T = and hence ^ U. Since U is non-empty, it must 
be the case that C/ = Q \ {0}, and again by the initial observation ^ is pp-definable in F. □ 

4.2. Definability of Sq. The rational numbers with addition (and also the real numbers 
with addition) admit quantifier elimination, i.e., every relation with a first-order definition in 
(Q; +) also has a quantifier- free definition over (Q; -|-). This follows from the more general 
fact that the first-order theory of torsion-free divisible abelian groups admits quantifier 
elimination (see e.g. Theorem 3.1.9 in [20j). 

The first lemma allows us to freely use certain expressions in pp-definitions over (Q; -|-). 

Lemma 4.3. The relation {(xi, . . . ,xi) \ rixi + . . . + rixi = 0} is pp-definable in (Q; -|-) for 
arbitrary ri, . . . , r/ G Q. 

Proof. First observe that we can assume that ri, . . . , r^ are integers, because we can multiply 
the equation rixi -|- • • • -|- rixi = by the least common multiple of the denominators of 
ri,...,n and obtain an equivalent equation. The proof is by induction on I. We first 
consider that case that 1 = 1. If ri = 0, there is nothing to show. Otherwise, the formula 
rixi = is equivalent to xi + xi = xi. Hence, we can in particular use expressions of the 
form X = and x -|- y = in pp-definitions over (Q; +) with variables x, y. If Z = 2, and 
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ri = or r2 = 0, then we can argue as in the case 1 = 1. If ri and r2 are both positive or 
both negative, then rixi + r2X2 = is equivalent to 

3tii , . . . , Ur^ ,Vi, . . . ,Vr2-Ul = Xi A Ui = X2 A Ur^ + W^j = A 

ri — 1 r2 — 1 

f\ Xi + Ui = Ui+l A /\ X2 + Vi = Vi+l . 

1=1 j=l 

If ri and r2 have different signs, we replace the conjunct u^ +Vr2 =0 in the formula above 
by Ur^ = Vr2 ■ 

Now suppose that / > 2. By the inductive assumption, there is a pp-definition (pi for 
riXi+r2X2+u = and a pp-definition (p2 for r3X3+. . . rixi+v = 0. Then 3u, v.(j)iA(j)2/\u+v = 
is a pp-definition for rixi + • • • + rixi =0. D 

In the following, R denotes a relation with a quantifier-free first-order definition (p in 
(Q; +). A quantifier-free first-order formula (j) in conjunctive normal form is called reduced 
if every formula obtained from (j) by removing a literal is not equivalent to 4> (this concept 
was introduced in [3j). Clearly, such a reduced definition of R always exists, because we 
can find one by successively removing literals from (p. Note that if / is a literal from (p, then 
-i/ can be written as a pp-formula over a structure that contains ^ and -|-. 

Lemma 4.4. If R is first-order, but not quantifier-free Horn definable in (Q; -|-), then Sq 
has a pp-definition in (Q; R,+,j^). 

Proof. Let T{x, y) C Q^ be the binary relation defined byx7^0A(y = 0Va; = y). We first 
prove that T has a pp-definition in (Q; i2, -|-, 7^). Let </> be a reduced first-order definition 
of R, and let C be a clause of (p with two positive literals li and I2. Because (p is reduced, 
there are p,q & R such that p satisfies /i and does not satisfy all other literals in C, and q 
satisfies I2 but does not satisfy all other literals in C. 

We claim that the following pp-formula is logically equivalent to x ^ OA{y = OVx = y). 

k 

3zi,...,Zk. x/0 A /\zi=piX + {qi-pi)y A 

i=l 

/\ ^l A R{zi,...,Zk) 
iec\{h,i2} 
Let X / be arbitrary. Suppose that y = 0. Then the assignment zi = pix, . . . ,Zk = PkX 
obviously satisfies the first line in the pp-formula. Recall that p € R and p does not satisfy 
all literals in C except for li. The function /(a) = x • a is in Aut(Q; -|-) whenever x 7^ 0. 
Consequently, / € Aut(Q;i?), too, and the second line in the formula is satisfied as well. 
Now suppose that x = y. Then the assignment zi = gix, . . . , z^ = qux obviously satisfies 
the first line in the pp-formula. By construction, q G R and q does not satisfy all literals in 
C except for li. Again we conclude that the second line in the formula is also satisfied. 

For the opposite direction, suppose that x,y £ Q satisfy the pp-formula. Because of 
the first line of the formula, x 7^ 0. Let zi,. . . ,Zk be the k elements whose existence is 
asserted in the first line of the formula. Note that the equations of the first line imply that 
(21, ... , Zfc) lies on the line L C Q'^ defined by px and qx. Because the formula contains 
the conjunct R{zi, . . . , z^), the clause C in i;^ is satisfied hy zi, . . . , z^. Since zi, . . . ,Zk also 
satisfies the conjunction of all negated literals in C except for the positive literals ^i and I2, 
at least one of these two literals li and I2 must be satisfied by zi, . . . , z^. 
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Suppose first that li is satisfied. The Hne L does not lie completely within the subspace 
of Q*^ defined by h (because q does not satisfy li, and neither does qx). Hence, L intersects 
this subspace in at most one point. Because p and hence also px & L satisfies ^i, we have 
thus shown that (zi, . . . , z^) equals px. Since p ^ q we conclude that y = by the equations 
in the second line of the formula. Now, consider the case that I2 is satisfied. Similarly as in 
the last case, L intersects the subspace defined by I2 in at most one point. Because q & L 
satisfies I2, we have shown that {zi, . . . ^Z}^) equals q. The equations in the second line of 
the formula then imply that x = y. 

Finally, we prove that Sq{u,v,'w) has the following pp-definition in (Q; +,T): 

3x,y. X + V = w A y + v = u A T{x,y). 

Suppose first that {u, v, w) G Sq. Note that x = w — v is not equal to because v ^ w. If 
u = V, then y = 0, and ii u = w, then x = w — v = u — v = yso T{x, y) is satisfied. 

Conversely, suppose that (x, y) G Q^ satisfies the pp-formula above. The formula 
T(x, y) implies that x 7^ and hence w ^ v. Moreover, T{x, y) implies that y = ov x = y. 
If y = 0, then u = V and (n, v, w) G Sq. If x = y, then w — v = u — v and hence u = w. 
Again {u,v,w) is in Sq. D 

4.3. Classification Result. We will now use Lemma 14.41 in order to prove the following 
definability result. 

Theorem 4.5. Let F be first-order expansion of (Q; +). Then, either 

• each relation in F has a quantifier-free Horn definition in (Q; -|-), or 

• every non-empty relation of F contains a tuple of the form (0, . . . , 0), or 

• every first-order definable relation in (Q; -1-) has a pp-definition in F. 

Proof. Suppose that there is a non-empty /c-ary relation R oiV that does not contain the 
tuple (0, . . . , 0). Then the {k + l)-ary relation R'{xi, . . . , Xk+i) defined by R{xi, . . . , x^) A 
Xk+i = is non-empty, and the relation defined by R'{x, . . . , x) is empty. So we can apply 
Proposition 14.21 and find that 7^ is pp-definable in (Q; -|-, R') and hence also in F. So assume 
in the following without loss of generality that F contains the relation 7^. 

Suppose that one of the relations of F does not have a quantifier-free Horn definition in 
(Q;+). Lemma 14.41 implies that the relation Sq has a pp-definition in F, and Corollarv 12.31 
implies that every relation with a first-order definition in (Q; =) has a pp-definition in F. 

Let -R be a relation with a first-order definition (p in (Q; +). To find a pp-definition for 
R in F, we introduce a variable u for every atomic formula of the form x + y = z in (p. For 
each atomic formula -(/^ in (/> of the form x-|-y = 2;, we replace iphy u^ = z for a new variable 
u^. The resulting formula consists of a boolean combination of atomic formulas of the form 
X = y, which we know has a pp-definition <j)' inT. For each atomic formula ijj in (p we add 
the conjunct x -|- y = n^ to (j)' , and finally existentially quantify over all new variables. It is 
straightforward to verify that the resulting formula is a pp-definition of i? in F. □ 

Theorem 14.51 has immediate consequences for the computational complexity of con- 
straint satisfaction. 

Corollary 4.6. Let F be a structure with a finite relational signature and a first-order 
definition in (Q; -|-) that contains the relation {(x, y,z) \ x + y = z}. Then CSP(F) is in P 
if all relations in F have a quantifier- free Horn definition over (Q;-|-), or if all non-empty 
relations contain a tuple of the form (0, . . . , 0), and is NP-hard otherwise. 
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Proof. If all relations in F have a quantifier- free Horn definition over (Q; +), then Proposi- 
tion [STT] implies that CSP(r) is in P. Otherwise, Theorem 14.51 implies that in particular the 
relation defined hy {x = y Ay y^ z) V (x y^ y Ay y^ z) is pp-definable in F. It follows from 
Theorem 12.21 that the constraint satisfaction problem for this ternary relation is NP-hard. □ 



5. Affine Structures over the Rational Numbers 

We will now consider affine additive structures over Q. The structure of this section 
is very similar to the structure of Section U) we begin by studying the definability of ^ 
(Section 15. ip and of Sd (in Section 15. 2p and use these results to completely classify the 
problem in Section [5.31 The main proof in Section [5.21 however, is very different from the 
corresponding proof in Section 14.21 

Let us now formally define the problem at hand: define the operation / : Q^ ^ Q by 
f {a, b, c) = a — b + c. We study the constraint satisfaction problem for templates F with a 
first-order definition in (Q; /) that contain the relation {(a, b, c, d) \ a — b + c = d}. 

5.1. Definability of Disequality. 

Lemma 5.1. Let F be a structure with a first-order definition in (Q;/). Then there are 
at most four first-order definable binary relations: the empty relation, the full relation, the 
relation ^, and the relation =. 

Proof. It suffices to show that Aut(F) has precisely two orbits on M^, namely 

Oi = {{x, x) \x eQ} and O2 = {{x, y) | x, y e Q, x / y} . 

These two orbits clearly partition Q-^. It is obvious that Oi is an orbit, because for every 
c G Q the mapping x 1— )• x + c is an automorphism of (Q; /) and hence of F. To see that O2 
is an orbit of pairs of reals, we apply linear interpolation: let (a, b) G O2 and (c, d) G O2 be 
arbitrary. The mapping x 1-^ ^^{x — a) + c maps (a, b) to (c, d) and it is an automorphism 
of (Q;/), and hence of F. D 

In the proof of Lemma 15.11 we have in fact verified that the automorphism group of F 
is 2-transitive, i.e., that there is only one orbit of pairs of distinct elements with respect to 
the componentwise action of the automorphism group of F on pairs. 

Theorem 5.2 (from [6J). Let F be a relational structure with a 2-transitive automorphism 
group. If there is no pp-definition of 7^, then there is an element x of F such that every 
non-empty relation in F contains a tuple of the form (x, . . . , x). 

5.2. Definability of Sd- The central step of the classification is the following result con- 
cerning pp-definability. 

Lemma 5.3. Let F be a relational structure over an infinite domain D such that D^, =, 7^, 
and are the only pp-definable binary relations. Suppose that F contains a relation Q such 
that there are pairwise distinct 1 <i,j,k,l <n for which the following conditions hold: 

(1) Q{xi, . . . , Xn) A Xi ^ Xj is satisfiable; 

(2) Q{xi, . . . , Xn) Axk y^ xi IS satisfiablc; 

(3) Q{xi, . . . , Xn) Axi ^ Xj A Xk y^ xi is unsatisfiable. 
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Then Sd has a pp-definition in F. 

We simphfy the proof of Lemma 15.31 by first proving a shghtly restricted version: 

Lemma 5.4. Let L be a relational structure over an infinite domain D such that D'^, =, 
7^, and are the only pp-definable binary relations. Suppose that T contains a relation Q 
such that there are 1 < i,j,k < n for which the following conditions hold: 

(1) Q{xi, . . . , Xn) A Xj 7^ Xj is satisfiable; 

(2) Q{xi, . . . , Xn) Axi ^ Xk is satisfiable; 

(3) Q{xi, . . . , Xn) A Xj 7^ Xj A Xi y^ Xk is unsatisfiable. 
Then So has a pp-definition in T. 

Proof. The indices i,j, k must be pairwise distinct, so suppose for the sake of notation that 
i = 1, j = 2, k = 3. Consider the relation R defined by 

R{xi,X2,X3) = 3X4, • • • , X„.Q(xi, . . . , X„) A X2 / X3 . 

We first note that i? is a non-empty relation: Q{xi, . . . , x„) is satisfiable so the only way of 
making R empty is that every tuple (si, . . . , s„) in Q satisfies S2 = S3. This is impossible 
since we know that there exists a tuple (si, . . . ,s„,) G Q such that si 7^ ■S2- This implies 
•si 7^ S3 and contradicts the third condition. 

Arbitrarily choose a domain element a. We first show that there always exist elements 
y,z G D such that (a,y,z) € R. Let A = {a £ D \ 3y,z.R{a,y,z)} and note that A 
is pp-definable. We know that A is non-empty since R is non-empty. Now assume that 
A C. D. First suppose that |^| = 1. Then 

A'{x,y) = A{x)AA{y) 

is non-empty and a strict subset of the equality relation, a contradiction. 
If \A\ > 1, then consider the pp-definable relation 

A'{x,y)=A{x)AA{y)Ax^y. 

We see that ^ '^ A' CI {{u,v) £ D^ \ u ^ v} which contradicts the fact that the only 
non-trivial binary relations that are pp-definable from F are = and 7^. Hence, A = D. 

We now continue by considering the tuple (a, y, z) G R. By the third condition, we 
see that at least one of y, z must equal a in order to satisfy R. Let us consider the case 
R{a, a, z). Note that (a, a^a) ^ R due to the literal y i^ z. We now show that R{a^ a, z) is 
satisfied by any choice of z except a. To see this, assume to the contrary that there is a 
domain element b ^ a such that {a,a,b) R. Define R'{x,z) = R{x,x,z) and note that 
C i?' C {(u,v) € D^ \ u ^ v} which contradicts the assumption that = and 7^ are the 
only non-trivial pp-definable binary relations. Similarly, one can show that R{a, y, a) holds 
for all y ^ a. Therefore R = So- D 

Proof of Lemma \5.3[ Assume for notational simplicity that i = l,j = 2,k = 3, and / = 4. 
Define the 4-ary relation R by 

i?(xi, X2, X3, X4) = 3X5, • • • , Xn-Qixi, ...,Xn) 

and consider the formula (j) = R{x , y , x' , y') A R{z' , y' , z,y) Ax' 7^ z' . We claim that (pAx^y 
and (j) Ay ^ z are satisfiable while (pAxj^yAy^zis not satisfiable. Then we can apply 
Lemma 15.41 and are done. First we make an observation: 

Observation 1. Define relation i?i such that 
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Ri{u, v) = 3x, y.R{x, y,u,v) A x ^ y. 
We know that R{x, y,u,v) A x ^ y is satisfiable so Ri is a non-empty relation. Since 
Ri{u, v) Au ^ V is not satisfiable, we conclude that Ri is a non-empty subset of the equality 
relation. Consequently, Ri is the equality relation. Analogously, define R2 such that 

R2{u, v) = 3z, y.R{u, v,z,y) Az ^y 
and note that R2 is the equality relation, too. 

We now prove that (pAx^yAy^zis not satisfiable. By using Observation 1, it follows 
that any solution s satisfies x' = y' and y' = z' — this is impossible due to the clause 
x' / z'. 

Next, we prove that (p A x ^ y is satisfiable; the case (p Ay ^ z is symmetric. Consider 
the relation 

U{u, v) = 3w.R{w, u,v,v) Aw ^ u. 
By the conditions on R, we know that U is non-empty. Since U is binary, we also know that 
U either is the equality relation, the disequality relation, or the full relation. We conclude 
that U is non-empty and symmetric. 

By Observation 1, the clause x ^ y has the effect that every solution s must satisfy 
x' = y' . The solution also has to satisfy x' 7^ z' which implies that y' 7^ z' . Observation 1 
now tells us that z = y and we conclude that every solution satisfies x' = y' and z = y. We 
define 

(/)' = R{x, y, x', x') A R{z', y' , z,z) A x' ^ z' A x ^ y 
Thus, (p' is satisfiable if and only if (pAx ^ y is satisfiable. We will now construct a concrete 
satisfying assignment s to the variables of <p' . 

Arbitrarily choose a tuple (a, b) G U and let s{y) = a, s{x') = b. By the conditions on U, 
there exists an element c such that (c, a, b,b) £ R and c 7^ a; we let s{x) = c. Furthermore, 
we know that s(x') = s{y') and s{z) = s{y) so s{y') = b and s{z) = a. At this point, we see 
that the assignment s satisfies the clauses R{x,y,x' ,x') and x ^ y. 

We know that (a, b) G U so (6, a) € U, too, and there exists a value d such that 
{d,b,a,a) G R and d ^ b. Now, let s{z') = d and note that R{z',y' ,z,z) is satisfied by s. 
Finally, s(x') = b ^ d = s{z') so the clause x' ^ z' is satisfied and the proof is completed. D 

5.3. Classification Result. We are now ready to prove the classification result for the 
affine case. 

Theorem 5.5. Let F be a first-order expansion of (Q; /). Then, either 

• each relation in F has a quantifier-free Horn definition in (Q; /), or 

• every non-empty relation of F contains a tuple of the form (0, . . . , 0), or 

• every first-order definable relation in (Q; /) has a pp-definition in F. 

Proof. Suppose that there is a non-empty A:-ary relation i? of F that does not contain the 
tuple (0, . . . , 0). The proof of Lemma 15.11 shows that F is 2-transitive, and hence by the 
contraposition of Theorem 15.21 the relation 7^ is pp-definable. So assume in the following 
without loss of generality that F contains the relation 7^. 

Let -R be a relation in F that does not have a quantifier-free Horn definition in (Q; /). 
Let (p{xi, . . . , Xn) be a reduced definition of R in (Q; /) (see SectionH]). Then there must be a 
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clause C incp with at least two positive literals /(xj^ , Xi^ , Sjg ) = Xj^ and f{xj^ , Xj^ , Xj^ ) = Xj^ . 
Let Q{xi, . . . , x„, x„+i, x„+2) be the relation defined by 

(/)(xi,...,x„) A Aiec\{hM"^^ 

Xn+l = / v^ii ) ^42 ) •^*3 J '^ Xn+2 ^ J K-^ju ■^J2T ■^ja)' 

This relation Q is clearly pp-definable over (Q;i?, /, 7^). We claim that Q satisfies 
the conditions of Lemma 15.31 (which is applicable due to Lemma 15. ip with respect to the 
arguments indexed by 14,12 -\- 1,J4, and n + 2 (or the conditions of Lemma 15.41 if i^ = j^; 
this remark also applies to all other places where we appeal to Lemma I5.3P . Since <j) is 
reduced, there is a tuple t ^ R that satisfies I2 and does not satisfy all other literals in 
(j). Now, the extended tuple ti = (t[l], . . . , t[n], t[i4], t[j4]) clearly satisfies Q, and we have 
ti [ii\ ^ ti\n + 2] as required in the conditions for Lemma 15.41 There is also a tuple t2 £ R 
that satisfies h and does not satisfy all other literals in C, and we can argue similarly to 
find a second tuple showing the second condition of Lemma 15.31 

Finally, suppose for contradiction that there is a tuple ^3 in Q where t^lii] 7^ t^ln + 1] 
and t3[J4] ^ t^ln + 2]. Because this tuple satisfies in particular the clause C from cp, the 
conjunct A;ec\{/ l I ~'^ implies that either /i or I2 is satisfied. But then the equalites Xn+i = 
f{xi^ , Xi2 , Xi3 ) and x„+2 = f{xj-^ , Xj^ , Xj.^ ) imply that is [^4] = ts ["- + 1] or is [J4] = ^3 [n + 2] , a 
contradiction. Hence, Lemma 15.41 applies. Sk is pp-definable over (Q;(5) and therefore also 
over (Q; i?, /, 7^) and T. The result follows from Corollary 12.31 □ 

The next corollary is a direct consequence of Proposition l3.lt Theorem 15.51 and Corol- 
lary! 



Corollary 5.6. Let T be an expansion of (Q; {(a, b,c,d) \ a — b + c = d}) by finitely many 
first-order definable relations. If each relation in T has a quantifier- free Horn definition in 
(M, /), or if each non-empty relation contains a tuple of the form (0, . . . ,0), then CSP(r) 
is in P. Otherwise, CSP(r) is NP-hard. 



6. Concluding Remarks 

We have presented classification results for certain algebraic constraint satisfaction 
problems, and the results are to a large extent based on dichotomy results for logical defin- 
ability. We feel that the results and ideas presented in this paper can be extended in many 
different directions. Hence, it seems worthwhile to provide some concrete suggestions for 
future work. 

The results and proof techniques in Section H] appear to be generalisable to many dif- 
ferent templates defined over various structures. One example is the natural and important 
class of structures that are definable in Presburger arithmetics |21] . i.e., structures that are 
first-order definable over the integers with addition (Z; +). We note that the following can 
be obtained by slightly modifying Corollary 14.61 



Corollary 6.1. Let F be a relational structure with a quantifier-free first-order definition in 
(Z; -|-) that contains the relation {(x, y,z) \ x + y = z}. Then CSP(F) is in P if all relations 
in F have a quantifier- free Horn definition over (Z; +), or if all non-empty relations contain 
a tuple of the form (0, . . . , 0). Otherwise, CSP(F) is NP-hard. 
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There is an important difference between this result and a full classification result: 
we have replaced first- order definability with quantifier-free first- order definability in the 
statement of the result, and the reason is that (Z; +) does not admit quantifier elimination. 
Is there still a complexity dichotomy if we look at the class of CSPs with an template that 
is first-order definable in (Z; +)? This appears to be a difficult question. 

The results presented in Section 5 have strong connections with earlier work on the 
complexity of disjunctive constraints l8l[T3]. We say that ^ is 1-independent with respect 
to a T-structure T if and only if for every primitive positive r-formula (j) with free variables 
X, y, z, w the following holds: if (j) A x ^ y and (p A z y^ w are satisfiable, then so is (/> A x 7^ 
y A z ^ w. Assume that CSP(r) is tractable and let V denote the set of all relations that 
can be defined by (quantifier-free) conjunctions of disjunctions over T containing at most 
one literal that is not of the form x ^ y. The following has been shown in [8l|T3]; it does 
not imply our result since it only makes a statement about a constraint language T' of the 
form described above. 

Theorem 6.2 (from [UlTS]). Let F and V be defined as above, and assume that P 7^ NP. 
Then CSP(r') is tractable if and only if 7^ is 1-independent with respect to T. 

We have already mentioned that the structures studied in this paper are in general 
not w-categorical. However, torsion-free divisible abelian groups such as (Q; -|-) and all 
structures first-order definable in such groups are strongly minimal (see e.g. Corollary 3.1.11 
in [20]), and hence categorical in all uncountable cardinals. This is interesting from a 
constraint satisfaction point of view because of the following preservation theorem. 

Theorem 6.3 (of [4J). Let T be an uncountably categorical structure with a countable 
relational signature and an uncountable domain. Then a first-order definable relation R 
has a pp-definition in P if and only if R is preserved by all infinitary polymorphisms of P. 

Note that this theorem is weaker than the corresponding theorem for w-categorical 
structures [7], because we have to assume that R is first-order definable, and that R is not 
only preserved by the finitary, but also by the infinitary polymorphisms of P. Since our 
classification result is purely in terms of primitive positive definability of first-order defin- 
able relations, it is an interesting question to describe the polymorphisms that guarantee 
tractability for structures P with a first-order definition in (Q; -|-) (Theorem 16.31 shows that 
such polymorphisms do exist). 
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